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PROCESS FOR IN-VITRO PRODUCTION OF RECOMBINED POLYNUCLEOTIDE 
SEQUENCES, SEQUENCE BANKS AND SEQUENCES THAT ARE THUS OBTAINED. 



This invention relates to a method for in-vitro production of recombined polynucleotide 
sequences. The object of the invention is most particularly to generate and then to select 
polynucleotide sequences that are likely to exhibit advantageous properties relative to reference 
sequences and are therefore able to impart a new phenotype and/or to produce new proteins. 

Different techniques have been developed to promote the in-vitro recombination between 
different polynucleotide sequences; among the latter, it is possible to cite more particularly DNA 
shuffling (1 1) and StEP (13), both based on the use of the PCR. 

DNA shuffling comprises two stages, the random fragmentation by DNAse I of 
polynucleotide sequences, and an amplification by PCR in which the fragments that are produced 
above are used as primers. With each hybridization stage, the change in matrix brings about 
recombinations with regard to the regions that have homologous sequences. A diagrammatic 
representation of this method is provided in Figure 1 A as an attachment. 

StEP consists in mixing different polynucleotide sequences that contain various 
mutations in the presence of a pair of primers. This mixture is subjected to a PCR- type reaction, 
in which the stages of hybridization and polymerization are combined in a single stage of very 
short duration. These conditions allow the hybridization of the primers but reduce the 
polymerization speed, such that the fragments that are partially synthesized are randomly 
hybridized onto the polynucleotide sequences that carry the various mutations, thus allowing the 
recombination. A diagrammatic representation of this method is provided in Figure IB, attached. 

In each of these two methods, the polymerization stage is essential to the recombination 
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process. Thus, based on selected polymerases, this polymerization stage can generate 
undesirable further mutations. In addition, starting from a certain number of cycles, the DNA 
shuffling and the StEP rest on the principle of the hybridization of a "mega-primer" (6) onto a 
matrix, which probably brings about difficulties of use for polynucleotide sequences whose size 
is greater than 1 .5 Kpb (10). Finally, these two techniques do not make it possible to monitor the 
rate of recombinations, since the latter are made randomly during successive polymerization 
stages. 

The specific object of this invention is to remedy the drawbacks above by offering a 
simple method for preparation of recombined polynucleotide sequences, making it possible to 
generate polynucleotide sequences that can exhibit advantageous properties relative to reference 
sequences and are therefore able to impart a new phenotype and/or to produce new proteins. 

This object is attained thanks to a process for in-vitro production of recombined 
polynucleotide sequences starting from a bank of polynucleotide sequences, characterized in that 
it comprises the following stages: 

a) the fragmentation of an initial bank of double-strand polynucleotide sequences, 

b) the denaturation of fragments obtained from stage (a) optionally in the presence of 
one or more assembly matrices, 

c) the hybridization of said fragments in the presence of one or more assembly 
matrices if the latter is not (are not) present in stage (b), 

d) the ligation of said fragments, 

e) the cloning of recombined polynucleotide sequences. 

The process of the invention can also comprise one or more of the following 
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stages: 

- the repetition of stages (b), (c) and (d) at the end of stage (d), 

- the separation of the recombined polynucleotide sequences from the assembly 
matrix or matrices before stage (e), 

- the amplification of the double-strand recombined polynucleotide sequences 
before cloning stage (e). 

The ends of the fragments generated in stage (a) are such that there can be adjacent 
hybridization of these ends onto the assembly matrix or matrices in stage (c) and ligation of these 
fragments with one another in stage (d). The polynucleotide sequences of the initial bank should 
exhibit areas of homology either between one another or with the assembly matrices, making it 
possible to generate ends of fragments as described above. 

An advantageous embodiment of the process of the invention consists in producing 
stages (c) and (d) simultaneously according to a reaction called RLR for the English expression 
of "Recombining Ligation Reaction." 

In addition to the advantages indicated above, the process of the invention is noteworthy 
in that it promotes and accelerates the random in-vitro recombination of polynucleotide 
sequences, whereby these polynucleotide sequences can be genes. 

The in-vitro recombination of polynucleotide sequences of the initial bank by the process 
of the invention thus makes it possible to obtain a new bank that contains sequences that have 
accumulated the characteristics of the sequences of the preceding bank. The process of the 
invention therefore constitutes an in-vitro evolution technique. 

The process of the invention constitutes an alternative to recombinatory PGR such as that 
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used in the techniques of DNA shuffling (1 1) or of StEP (13), since it does not require an in-vitro 
polymerization stage to ensure the recombination. On the contrary, the key stage of the process 
of the invention is high-temperature ligation stage (d) on an assembly matrix, which ensures a 
very high degree of accuracy during recombination events. 

The fragmentation of these polynucleotide sequences in stage (a) makes it possible to 
control with precision the desired degree of recombination and the position of recombination 
points. Thus, the higher the number of fragments generated per sequence, the higher the number 
of fragments necessary for recomposing a sequence, which brings about a high recombination 
level. In addition, the nature and the position of the ends of the fragments generated in this 
embodiment of the process of the invention are known and controlled, which makes it possible: 

- to control with precision the zones where the recombination took place, or 

- to induce the recombination between polynucleotide sequences, for example 
genes, if the ends of fragments are created in homology zones between these 
sequences, or homology zones between these sequences and the assembly 
matrix or matrices. 

The process of the invention can be applied to the orientation of the free-end multi- 
molecule ligation. \n this application, oligonucleotides that are just complementary to end 3' of a 
fragment and 5' of the adjacent fragment, which allows the adjacent hybridization of these two 
ends onto the same matrix after the denaturation stage, are used as an assembly matrix in stage b 
or c. Once hybridized, the ends of the fragments can be ligated together so as to orient the 
ligation direction of the free-end fragments. 

The initial bank of stage (a) can be generated by any method that is known to one skilled 
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in the art, for example from a savage gene by mutagenesis stages that are directed in succession, 
by "error-prone" PGR (2), by random chemical mutagenesis, by random in-vivo mutagenesis, or 
by combining genes of close or separate families within the same type or different types so as to 
use a variety of polynucleotide sequences in the initial bank. 

Among these techniques, the invention considers more particularly a process in which the 
initial bank of double-strand polynucleotide sequences is obtained by a polymerization chain 
reaction that is carried out under conditions that make it possible to create random mutations. 

According to a preferred embodiment of the process of the invention, stage (a) consists in 
subjecting the initial bank to hydrolysis by the action of one or more restriction enzymes. 

To increase the degree of recombination generated by the process of the invention, it is 
sufficient to increase the number of restriction fragments by using restriction enzymes that have a 
large number of cutting sites on the polynucleotide sequences of the initial bank, or by combining 
several restriction enzymes. In the case of use of a thermostable ligase, the limit is set by the size 
of the smallest fragment thus generated, for example greater than or equal to 40 pb, so as to 
preserve a hybridization temperature that is compatible with that of ligation stage (d), which is 
generally on the order of 65'^C. 

Stage (a) can also be carried out by generating a bank of fragments by random treatment 
with the DNAse I of an initial bank of double-strand polynucleotide sequences that are partially 
heterologous. This implementation of the process of the invention would have the particular 
feature of making possible the use of DNAse I fragments such as matrices for one another, for 
hybridization during stage (c) or the simultaneous RLR reaction of stages (c) and (d). 

A similar implementation of the process of the invention can be carried out by combining 



6 



with stage (b) at least two banks of separate fragments generated separately in stage (a) from the 
same initial bank by a treatment with different enzymes. The use of such banks no longer 
requires the use of a consensus matrix during stage (b) or (c), since the fragments can be 
hybridized onto one another. 

The fragments of stage (a) of the process of the invention can also be generated by PGR 
reactions conducted on the polynucleotide sequences of the initial bank. Two solutions are 
conceivable. In a first case, the oligonucleotides can be designed so as to generate fragments 
whose ends are adjacent all along the assembly sequence. In a second case, the oligonucleotides 
are designed so as to generate fragments that have common sequences, whereby these fragments 
can be used as an assembly matrix for one another in stage (b) or in stage (c). 

The assembly matrix in stage (b) or (c) is, for example, a polynucleotide sequence that is 
obtained from the initial bank or a consensus sequence of said bank, single- or double-strand. In 
the case where the assembly matrix is incorporated directly into stage (c) of the invention, this 
matrix should be in single-strand form. 

According to a variant of the process of the invention, the assembly matrices of stage (b) 
or (c) consist of oligonucleotides. 

According to a particular embodiment of the process of the invention, single- or double- 
strand oligonucleotides of variable length are added at stage (c) in addition to the matrix. These 
oligonucleotides are designed to be able to be substituted for a portion of the fragments in stage 
(c), actually, their sequence is such that: 

- if they are completely homologous with the sequence of the fragment that they 
are replacing, they promote certain combinations, or 
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- if they are partially heterologous with the sequence of the fragment that they 
are replacing, they introduce one or more further directed mutations. 

Before stage (e) of the process of the invention, it is possible to separate the recombined 
polynucleotide sequences from the assembly matrix thanks to a marker that is present on the 
assembly matrix or on the recombined polynucleotide sequences. It is actually possible to mark 
each strand of the matrix according to techniques known by one skilled in the art. For example, 
the marker of the assembly matrix can be a hapten, and the recombined polynucleotide sequences 
of the assembly matrix are separated by techniques that are known to one skilled in the art, such 
as, for example, an anti-hapten antibody that is attached to a substrate or a biotin-streptavidin 
reaction, if the hapten is a biotin marker. 

Other techniques may be used to separate the recombined polynucleotide sequences from 
the assembly matrix. The assembly matrix can also be prepared specifically so as to facilitate its 
elimination at the end of the process of the invention. It can thus be synthesized by PGR 
amplification using methylated dATP, which allows its degradation by the Dpn I restriction 
endonuclease. In this case, the recombined polynucleotide sequences should not contain 
methylated dATP. The matrix may also have been prepared by PGR amplification by using 
dUTP, which allows its degradation by treatment with a uracyl-DNA-glycosylase. Gonversely, it 
is possible to protect the recombined polynucleotide sequences by amplifying them by selective 
PGR with oligonucleotides that carry phosphorothioate groups at 5'. Treatment with an 
exonuclease then makes it possible specifically to degrade the assembly matrix. 

As indicated above, the process of the invention can comprise, before cloning stage (e), a 
stage for amplification of recombined polynucleotide sequences. Any amplification technique is 
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acceptable, in particular an amplification by PGR. One of the simplest consists in producing a 
PGR that makes it possible to amplify specifically the recombined polynucleotide sequences 
thanks to primers that can be hybridized only onto the ends of recombined sequences. The PGR 
products are then cloned so that they can be characterized. 

The invention also relates to a process for generating polynucleotide sequences that are 
likely to have advantageous properties compared to reference sequences. This process consists in 
producing recombined polynucleotide sequences according to the process that is described above, 
then in screening by any suitable means the clones that are obtained for selecting those that 
comprise recombined polynucleotide sequences that have advantageous properties relative to 
reference sequences. For example, advantageous properties are defined as the thermostability of 
an enzyme or its capacity to be able to operate under conditions of pH or temperature or saline 
concentration that are more suitable to an enzymatic process than the reference proteins that are 
usually used for said process. By way of example of such a process, it is possible to cite an 
industrial process for desizing textile fibers or for bleaching paper pastes or for producing aromas 
in the dairy industry. 

According to an advantageous embodiment of this application of the process of the 
invention, the initial bank can therefore be the result of a screen that has made it possible to 
select by any suitable means the polynucleotide sequences that exhibit advantageous properties 
relative to reference sequences. The thus selected sequences constitute an initial limited bank. 

It is also possible, however, to start fi"om an initial unlimited bank so as to preserve the 
representative quality of the properties contained in this bank. 

The sequences that code for the proteins that have advantageous properties relative to the 
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reference proteins are then selected, by in-vivo and in-vitro screenings, and can be used in 
constituting a new bank for an optional reiteration of the process of the invention. 

Among the screening techniques that can be applied to each of the clones of stage (e), the 
in-vitro screening techniques offer the advantage of eliminating problems of cellular physiology 
and all the drawbacks that are linked to the in-vivo expression cloning, hi addition, this type of 
screening is easy to automate, which makes it possible to screen a high number of recombined 
polynucleotide sequences. 

The invention also relates to a recombined polynucleotide sequence that is obtained by a 
process according to the invention, as well as a vector that contains such a recombined 
polynucleotide sequence, a cellular host that is transformed by a recombined polynucleotide 
sequence or a vector of the invention, as well as a protein that is coded by this recombined 
polynucleotide sequence. The invention also comprises the corresponding banks of recombined 
polynucleotide sequences, vectors, cellular hosts or proteins. 

Other advantages and characteristics of the invention will emerge from the following 
embodiments of the invention that refer to the accompanying drawings, in which: 

Figure 1 is a diagrammatic representation of the processes of the prior art corresponding 
respectively to DNA shuffling (Figure lA) and to StEP (Figure IB). 

Figure 2 is a diagrammatic representation of an embodiment of the process of the 
invention and of some of its variants and application. 

Figure 3 represents the positions of the ten mutation zones {Pvu II and Pst I) carried by 
each mutant of the ponB gene that is used for the embodiments of the invention. 

Figure 4 represents the position of the primers that are used relative to the sequence of the 
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ponB gene. 

Figure 5 represents the migration of the products of the RLR and PGR reactions of the 
products of these RLR reactions to agarose gel. 

Figure 6 represents the position of the mutations relative to the restriction fragments. 

I-P R T NCT P T.R . 

The process of the invention was used starting from a bank of mutants of the ponB gene, 
coding for the PBPlb of E, coli (1). Ten mutants of this gene were used. The gene sequence of 
each mutant differs from that of the savage gene by a non-homologous zone of thirteen to sixteen 
bases consisting of the substitution of five initial codons by five alanine codons according to the 
technique described by Lafevre et al (8). 

The substitution carried by every mutant is characterized by the presence of a single site 
of the Pvu II restriction enzyme framed by two sites of the Pst I enzyme, which make it possible 
to distinguish the mutants from one another by their digestion profile with these restriction 
endonucleases. Figure 3 shows the positions of ten mutation zones {Pvu II and Pst I) that are 
carried by every mutant. 

After amplification by PGR of the genes of the ten mutants, the PGR products were 
purified, mixed in an equimolar amount to constitute the initial bank. The polynucleotide 
sequences of this bank were digested by restriction enzymes Hinf I and Bsa I, so as to generate 
restriction fragment banks. The restriction fragments were then incubated with the savage 
matrix, in different amounts, in the presence of a thermostable ligase. After several cycles of 
denaturation/hybridization/ligation, a fraction of this reaction mixture was used to carry out a 



PGR amplification with a pair of specific primers of ends 5' and 3' of the genes of mutants and 
non-specific genes of ends 5' and 3' of the savage matrix. The amplification product was cloned, 
and the clones obtained were analyzed for their digestion profile with restriction endonuclease 
Pvu II or Pst L The profiles obtained made it possible to determine which fi'agment(s) of the 
mutants was (were) able to be recombined with the others to reconstitute an entire gene. 

II -MATERIAL. 

1) Co lonies and Plasmids. 

The MC1061 colony (F" araDU9, A (ara-leu)7696, galElS, galK16, A (lac)X74, rpsL 
(Str ), mcrA mcrBI, hsdR2 (rk'mk )) is derived fi*om Escherichia coli K12. 

The pARAPONB vector is obtained firom the pARA13 vector (3) in which the ponB gene 
that carries a thrombin cutting site (9) was introduced between the Nco I and Nar I restriction 
sites. The pET26b+ vector is part of the family of pET vectors developed by Studier and Moffatt 
(12) and marketed by the NOVAGEN Gompany. 

2) Oligonucleotides. 

The oligonucleotides were synthesized by the ISOPRIM Gompany (Toulouse). The 
sequences of the oligonucleotides are reported in Table I below. 
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OligoN 


5' ACTGACTACCATGGCCGGGAATGACCGCGAGCC 3' 


Oligo E 


5' CCGCGGTGGAGCGAATTCTAATTACTACCAAACATATCC 3' 


Olieo Ml 


5' GCGCCTGAATATTGCGGAGAAAAAGC 3' 


Olieo M2 


5' ACAACCAGATGAAAAGAAAGGGTTAATATC 3' 


Olieo Al 


5' ACTGACTACCATGGCC 3' 


Oligo A2 


5' CCGCGGTGGAGCGAATTC 3' 



The enzymes of restrictions and modifications cited in Table II below were used 
according to the recommendations of the suppliers. 



Tflhle TT 



Enzyme 


Concentration 


Supplier 


Ncol 


10U/|al 


NEB 


PstI 


20 U/|jl 


NEB 


EcoRI 


20 U/^1 


NEB 


Bsal 


5U/^il 


New England Biolabs 


Hinfl 


lOUV 


New England Biolabs 


PvuU 


lOUV 


New England Biolabs 


T4 DNA ligase 


400 U/^1 


New England Biolabs 


Taq DNA polymerase 


5 Will 


PROMEGA 


AMPLIGASE 


lOOU/^1 


EPICENTRE 



The buffers that are used are reported in Table III below. 



Tahlp TTT 



Buffers 


Composition 


T 


10 mmol or ins HCl, pH S.O 


20X Polymerization 


100 mmol of Tris HCl, pH 8.3, 15 mmol of 
MgCh, 500 mmol oi K.C1, 1.0% Inton XlOO 


lOX Restriction A 


500 mmol of NaCl, 100 mmol of Tris HCl, pH 
7.9, 100 mmol of MgCb, 10 mmol of DTT 


lOXRestnctionB 


1 M NaCl, 500 mmol of Tns HCl, pH 7.9, 100 
mmol of MgCb, 10 mmol of DTT 


1 OX Restriction C 


500 mmol of NaCl, 1 M Tris HCl, pH 7.5, 100 
mmol 01 MgL^i2, \).zj /o inion Aiuu 


lOX AMPLIGASE 


200 mmol of Tris HCl, pH 8.3, 250 mmol of 
KCl, 100 mmol of MgCb, 5 mmol of DNA, 
0.1% Triton XlOO® 


1 OX LIGATION 


500 mmol of Tris HCl, pH 7.5, 100 mmol of 
MgCb, 100 mmol of DTT, 10 mmol of ATP, 
250 ^g/ml of BSA 
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III - PRFPAR ATTON OF THF MATRIX 

The savage ponB gene was amplified by a PGR reaction stage by using the Ml and M2 
oligonucleotides as primers (Fig. 4). Five PGR reactions were prepared by adding 50 ng of the 
pPONBPBR plasmid that carries savage gene (7) with a mixture that contains 10 |al of 
polymerization buffer, 10 |li1 of 2 mmol dNTPs, 20 pmol of each Ml and M2 oligonucleotide and 
5U of Taq DNA polymerase, in a final volume of 100 |il. These mixtures were incubated in a 
Perkin-Elmer 9600 thermal cycler according to the following program: (94°G - 2 minutes) - 
(94°G, 15 seconds - 60°G, 30 seconds - 72°G, 1 minute) x 29 cycles - (72°G - 3 minutes). 

The product of the five PGR was mixed and deposited on a 1% TBE agarose gel. After 
migration and coloration with the ethidium bromide of the gel, the band at 265 1 pb, 
corresponding to the amplification product of the ponB gene firamed by two fi*agments of 26 pb 
and 90 pb respectively, was visualized by ultraviolet trans-illumination and cut with a scalpel to 
be purified with the Quiaquick (QIAGEN) system. The complete DNA thus purified was eluted 
in 120 jil of T buffer. The concentration of this DNA was evaluated by a spectrophotometric 
dosage at 260 nm, at 100 ng/|al. 

TV - PRFP AR ATTON OF THF RANK 
1) Amplification of M utant Genes. 

The genes of the ten mutants were amplified separately via a PGR reaction by using the 
N and E oligonucleotides. These oligonucleotides respectively introduce the Nco I and Eco RI 
restriction sites, making it possible to clone the products that are obtained with these two sites. 

Each PGR reaction was prepared by adding 50 ng of the plasmid that carries the mutant 
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gene with a mixture that contains 10 |il of polymerization buffer, 10 jil of 2 nmiol dNTPs, 20 
pmol of each N and E oHgonucleotide, and 5U of Taq DNA polymerase, in a final volume of 100 
|il. This mixture was incubated in a Perkin-Elmer 9600 thermal cycler according to the 
following program: (94°C - 2 minutes) - (94°C, 1 5 seconds - 60°C, 30 seconds - 72°C, 1 
minute) x 29 cycles - (72°C - 3 minutes). 

The specificity of the gene amplification was verified by restriction profile with the Pvu II 
endonuclease by incubating 5 |il of each PGR product for 1 hour at 37°C in a mixture that 
contains 3 of restriction buffer A and 5U of the Pvu II enzyme in a final volume of 30 |il. 15 
|il of this digestion reaction was deposited on a 1% TBE agarose gel. After migration and 
coloration with ethidium bromide, the gel was exposed to ultraviolet. The visualization of the 
restriction fi*agments made it possible to confirm the specificity of the gene amplification of each 
mutant gene. 

Concurrently, 3 jal of each PGR reaction was deposited on a 1% TBE agarose gel. After 
migration, the gel was treated as above. The intensity of each band made it possible to estimate 
that the gene amplifications had had the same yield. 

2) rreation of Restriction Fragment Ranks , 

50 |il of each of the ten PGR was mixed and deposited on a 1% TBE agarose gel. 
After migration and coloration with ethidium bromide, the band at 2572 pb, corresponding to the 
product for amplification of the genes of the ten mutants, was cut with a scalpel and purified with 
the Quiaquick (QIAGEN) system. The entire DNA thus purified was eluted in 120 jal of T 
buffer. The concentration of this DNA was evaluated by a spectrophotometric dosage at 260 nm, 



at 100 ng/jal. 

To generate the restriction fragment banks, 100 |il of this DNA was incubated for one 
hour at 50°C in a mixture that contains 12 ^1 of restriction buffer B, 1.2 |li1 of BSA (with 10 
mg/ml), 25 U of the Bsa I enzyme, and 4 jil of water. Then, 2 yX of restriction buffer B, 2 |Lil of 
BSA (with 1 mg/ml), 50 U of the Hinfl enzyme and 1 1 .5 |il of water were added to the mixture, 
which was incubated for one hour at 37°C. The entire digestion mixture was purified on a 
QIAquick (QIAGEN) colunrn and eluted with 30 |il of buffer T. 1 yX of this eluant was deposited 
on 1% TBE agarose gel to verify that the digestion had been total and that it generated 6 
restriction firagments, and consequently six fi-agment banks, of 590 pb, 500 pb, 472 pb, 438 pb, 
298 pb, and 274 pb. The concentration of this DNA was evaluated (by a spectrophotometric 
dosage at 260 nm) at 250 ng/|il. 

V- RT .R (Recombinin g li gation Reaction). 

The RLR (Recombining Ligation Reaction) reaction was carried out by incubating 
determined amounts of Hinfl - Bsa I restriction fi"agments of genes of the ten mutants with the 
complete matrix (i.e., the savage ponB gene), in the presence of a thermostable DNA ligase. 
Table IV below reports the composition of the RLR mixtures. 
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Table TV 





RLR 1 


RLR 2 


RLR 3 


RLR 4 


T- 


Hinfl - Bsa I Fragments of the Ten 
Mutants f 100 ne/jiH 


0.5^1 


1 jil 


2^1 


5^1 


5^1 


Savage ponB Matrix (100 ng/^1) 


0.6 ,a 


1.2 ,il 


2.4 ,il 


6,il 


6,il 


lOX AMPLIGASE Buffer 


2|j1 


2,11 


2,xl 


2,il 


2,.l 


AMPLIGASE (25 U/jiD 




1 |il 








H2O 


Sufficient 
quantity 
for 20liI 


Sufficient 
quantity 
for 20li1 


Sufficient 
quantity 
for20ul 


Sufficient 
quantity 
for 20^1 


Sufficient 
quantity 
for 20^il 



The negative control is identical to the RLR4 reaction, but it does not contain 
thermostable DNA ligase. These different mixtures were recovered from a drop of mineral oil 
and incubated in a Perkin-Elmer 9600 thermal cycler in 200 jal microtubes according to the 
following program: (94°C, 5 minutes) - (94°C, 1 minute - 65°C, 4 minutes) x 35 cycles. 

10 (il of each RLR reaction was then added to a PGR reaction mixture that contains 10 jil 
of polymerization buffer, 10 |il of 2 nrmiol dNTPs, 40 pmol of each Al and A2 oligonucleotide, 
and 5 U of Taq DNA polymerase in a final volume of 100 This mixture was incubated in a 
Perkin-Elmer 9600 thermal cycler according to the following program: (94°C, 5 minutes) - 
(94°C, 30 seconds - 46°C, 30 seconds - 72°C, 1 minute) x 29 cycles - (72^C, 2 minutes). This 
PGR reaction made it possible to amplify specifically the ligation products that are formed during 
the RLR reaction, without amplifying the matrix, then the Al and A2 oligonucleotides cannot be 
hybridized onto the latter, as shown in Figure 4. 

5 )il of each RLR reaction and 10 |il of each of the preceding PGR reactions was 
deposited on a 1% TBE agarose gel. After coloration with ethidium bromide, the gel was 
exposed to ultraviolet, as shown in Figure 5. 
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The analysis of this gel reveals that only the RLR4 reaction contains, as the negative 
control, restriction fragments that are still visible (tracks 4 and 5). 

The absence of the PGR product for the negative control (track 10) reveals not only that 
the PGR reaction is specific (no amplification of the complete matrix) but also that the restriction 
fragments that are present in the mixture cannot be substituted with primers to generate a 
contaminating PGR product under the selected conditions. Goncurrently, the presence of a single 
band with about 2500 pb on tracks 6, 7 and 8 demonstrates that an RLR product could be 
amplified by PGR for the RLRl , 2 and 3 reactions. These three RLR reactions were therefore 
allowed to reconstitute one or more complete genes from six banks of restriction fragments. 

VT - ANAT.V^TS OF PT^ODTTrTR FOR AMPT JFirATTON OF THF. RTR 
1) rioning . 

The PGR amplification products of the RLR 1, 2 and 3 reactions were purified with the 
Wizard PGR Preps System (PROMEGA) and eluted in 45 of buffer T. 6 |al of each purified 
PGR was incubated for 1 hour at 37°G in a mixture that contains 3 |li1 of restriction buffer G, 3 [il 
of BSA (at I mg/ml), 20 U of the Eco RI enzyme, 10 U of the Nco I enzyme, and 15 |4.1 of water. 

Goncurrently, two vectors (pARAPONB and pET26b+) were prepared for the cloning. 
These vectors were linearized by incubating 3 |ag of these plasmids for 2 hours at 37°G, in a 
mixture that contains 3 |al of restriction buffer G, 3 ^1 of BSA (at 1 mg/ml), 20 U of the Eco RI 
enzyme, 10 U of the Nco I enzyme and 19 |al of water. 

The linearized vectors as well as the digested PGR were purified on 1% TBE agarose gel 
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with the QIAquick (QUIAGEN) System. Each vector or each digested PGR was eluted in 30 
of buffer T. 

The hgation of each PGR digested with one or the other of the vectors was carried out 
according to the conditions that are described in Table V below, and incubated at 16°G for 16 
hours. 



Table V 





Ligation with the pARAPONB Vector 


Ligation with the pET26b+ Vector 




LpARl 


LpAR2 


LpAR3 


TlpAR 


LpETl 


LpET2 


LpET3 


TLpET 


PGR 

Amplification 
of the 

digestea klk 
1 Nco I - Eco 
-RI 










4^1 








PGR 

Amplification 
of the 

digested RLR 
2 Nco I - Eco 
RI 




4|al 








4^1 






PGR 

Amplification 
of the 

digested RLR 
3 Nco I - Eco 
RI 






4^1 








4^1 




Nco I- Eco RI 
-digested 
pARAPONB 
Vector 


1^1 




1^1 


1 ^il 










Nco I- Eco RI 

digested 

DET26b+ 










1 lal 


1^1 


1^1 


ipii 
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Ligation with the pARAPONB Vector 


Ligation with the pET26b+ Vector 




LpARl 


LpAR2 


LpAR3 


TlpAR 


LpETl 


LpET2 


LpET3 


TLpET 


Vector 


















Ligation 
Buffer 




2^1 


2^1 


2^1 


2^1 


2^1 


2^1 


2^1 


Lipase 


-M— 


1 III 


_l4iL_ 


1 lil 


l^il 




1 Ml 


1^1 


H2O 


12 111 


12 ul 


12 ul 


16^1 


12fil 


12^1 


12^1 


16)il 



200 [4,1 of chemocompetent MCI 061 cells (4) were transformed with 10 |li1 of each 
ligation by a thermal shock (5), and the thus transformed cells were smeared on a selection 
medium. 

No clone was obtained after transformation of the TL pAR and TL pET ligation controls, 
thus indicating that Nco I - Eco RI linearized pARAPONB and pET26b+ vectors cannot undergo 
intramolecular ligation. 

2) Screening hy PCR 

A first screening of clones obtained after transformation of ligations with the 
pARAPONB vector was carried out by PGR. 42 colonies, 14 of each LpARl, LpAR2 and 
LpAR3 ligation, were resuspended individually in a PGR mixture containing 5 |li1 of 
polymerization buffer, 40 pmol of each Al and A2 oligonucleotide, 5 j^l of 2 mmol dNTPs and 
5U of Taq DNA polymerase in a final volume of 50 (il. A negative control was constituted by 
adding to the PGR mixture 50 ng of pBR322 plasmid instead of a colony. These 43 tubes were 
incubated in a Perkin-Elmer 9600 thermal cycler according to the following program: (94°G, 5 



minutes) - (94°C, 30 seconds - 46°C, 30 seconds - 72°C, 1 minute) x 29 cycles - (72°C, 2 
minutes). 5 ^l of each of these PGR reactions was then incubated for 1 hour at 37°C in a mixture 
containing 2 |il of restriction buffer A, 2 (il of BSA (at 1 mg/ml) and 5 U of Pvu II restriction 
enzyme in a final volume of 20 

10 ^1 of each of these digestions was deposited on a 1% TBE agarose gel concurrently 
with 5 |al of each non-digested PGR (which makes it possible not to combine possible non- 
specific bands of the PGR with a fi-agment that is obtained by restriction digestion). 

After migration and coloration with ethidium bromide of this gel, the bands that are 
obtained from the digestion by the Pvu II enzyme were analyzed so as to determine what 
fragment(s) of the initial mutants were associated with the others to reconstitute an entire gene. 
This screening reveals the presence of 27 genes that support a mutation, 7 genes that support two 
mutations, and 8 genes that no longer support mutation. 

3) Screening hy Mini -Preparation of Plasmid DNA . 

The second screening was carried out by performing an extraction from plasmid DNA (5) 
of 21 clones obtained from the transformation of ligations with the pET26b+ vector (7 clones of 
each ligation). 5 j^l of the thus obtained plasmid DNA for each clone was incubated for 1 hour at 
37°G in the presence of a mixture that contains 1 \J^ of restriction buffer G, 6 U of the Pst I 
enzyme, 3 U of the Nco I enzyme and 6 U of the Eco RI enzyme in a final volume of 10 ^1. 5 |il 
of each of these digestions was deposited on a 1% TBE agarose gel. After migration and 
coloration with ethidium bromide of this gel, the bands that are obtained from the digestion by 
the Pst I enzyme were analyzed so as to determine what fragment(s) of the initial mutants were 
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associated with the others to reconstitute an entire gene. This screening reveals the presence of 
13 genes that support a mutation, 5 genes that support two mutations and 3 genes that no longer 
support mutation. 

4) Stnfktical Analysis o f Remmhi nation 

Based on the position of each mutation relative to the cutting sites of the Hinf I and Bsa I 
enzymes, as shown in Figure 6, it is possible to calculate the probability of obtaining the creation 
of a gene that carries 0, 1, 2, 3 or 4 of the mutations of the initial genes during the RLR reaction. 

Thus, by considering that the RLR reaction is totally random, probabilities P are as 



follows: 



P(D mutation) 




P(l mutation) = 




7a 




P{3 mutations) 



P{2 mutations) = 




The two screenings that are carried out provide results that are close to these statistical 



forecasts, as reported in Table VI below, thus indicating that the RLR reaction is almost random. 
A slightly larger proportion of genes that support a mutation, to the detriment of the genes that 
support zero mutation, is observed. This phenomenon could be attributed to a low toxicity of the 
ponB gene already observed and to the slight shift of the pARAPONB and pET26b+ expression 
vectors that would promote the selection of genes that support an inactivating mutation. 



Table VT 



% 


0 

Mutation 


1 

Mutation 


2 

Mutations 


3 

Mutations 


4 

Mutations 


Statistics 


30.24 


44.04 


21.44 


4.04 


0.24 


PCR 
Screening 


21 


63 


16 


0 


0 


Mini- 
preparation 
Screening 


14 


62 


24 


0 


0 
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CLAIMS 

1) Process for in-vitro production of recombined polynucleotide sequences starting from 
a bank of polynucleotide sequences, characterized in that it comprises the following stages: 

a) the fragmentation of an initial bank of double-strand polynucleotide sequences, 

b) the denaturation of fragments obtained from stage (a) optionally in the presence of 
one or more assembly matrices, 

c) the hybridization of said fragments in the presence of one or more assembly matrices 
if the latter is not (are not) present in stage (b), 

d) the ligation of said fragments, 

e) the cloning of the recombined polynucleotide sequences. 

2) Process according to claim 1, wherein it comprises the repetition of stages (b), (c) and 
(d) at the end of stage (d). 

3) Process according to one of claims 1 or 2, wherein it comprises the separation of the 
recombined polynucleotide sequences of the assembly matrix or matrices before stage (e). 

4) Process according to one of claims 1 to 3, wherein it comprises the amplification of 
the double-strand recombined polynucleotide sequences before cloning stage (e). 

5) Process according to any of claims 1 to 4, wherein the ends of the fragments generated 
in stage (a) are such that there can be adjacent hybridization of these ends onto the assembly 
matrix or matrices in stage (c) and ligation of these fragments with one another in stage (d). 

6) Process according to any of the preceding claims, wherein the polynucleotide 
sequences of the initial bank have homology zones either between one another or with the 
assembly matrices, making it possible to generate ends of fragments that make possible the 



adjacent hybridization of these ends onto the assembly matrix or matrices in stage (c) and 
ligation of these fragments with one another in stage (d). 

7) Process according to any of the preceding claims, wherein stages (c) and (d) are 
carried out simultaneously. 

8) Process according to any of the preceding claims, wherein the degree of recombination 
and the position of the recombination points are controlled by the fragmentation of stage (a). 

9) Process according to any of the preceding claims, wherein the initial bank of stage (a) 
is generated from a savage gene by mutagenesis stages that are directed in succession by random 
mutagenesis, or by combining genes of close or separate families within the same type or of 
different types so as to use a variety of polynucleotide sequences in the initial bank. 

10) Process according to any of the preceding claims, wherein stage (a) consists in 
subjecting the initial bank to hydrolysis by the action of one or more restriction enzymes. 

1 1) Process according to any of the preceding claims, wherein stage (a) consists in 
subjecting the initial bank to hydrolysis by action of restriction enzymes that have a large number 
of cutting sites on the polynucleotide sequences of the initial bank or by combining several 
restriction enzymes. 

12) Process according to any of the preceding claims, wherein in stage (a), fragments are 
prepared whose size is greater than or equal to 40 pb. 

13) Process according to any of the preceding claims, wherein stage (a) consists in 
generating a bank of fragments by random treatment with the Dnase I of the initial bank of 
double-strand polynucleotide sequences. 

14) Process according to any of the preceding claims, wherein stage (b) is carried out by 
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combining at least two banks of separate fragments that are generated separately in stage (a) from 
the same initial bank by a treatment with different enzymes. 

15) Process according to any of the preceding claims, wherein the fragments that are 
generated in stage (a) are generated by PGR starting from (a) polynucleotide sequence(s) of the 
initial bank. 

16) Process according to one of claims 13 to 15, wherein the fragments that are obtained 
in stage (a) are used as an assembly matrix for one another in stage (b) or in stage (c). 

17) Process according to any of claims 1 to 15, wherein the assembly matrix of stage (b) 
or (c) of the process is a polynucleotide sequence that is obtained from the initial bank or a 
consensus sequence of said single- or double-strand bank. 

18) Process according to any of claims 1 to 15, wherein the assembly matrices of stage 
(b) or (c) of the process consist of oligonucleotides. 

19) Process according to any of claims 1 to 1 8, wherein stage (c) is carried out in the 
presence of single- or double-strand oligonucleotides of variable length. 

20) Process for in-vitro production of recombined polynucleotide sequences that exhibit 
advantageous properties relative to reference sequences, wherein a process is carried out 
according to any of claims 1 to 19, then wherein the clones that are obtained are screened by any 
suitable means to select those that comprise polynucleotide sequences that exhibit advantageous 
properties relative to reference sequences. 

21) Process according to claim 20, wherein the screening of the clones is carried out in 

vitro, 

22) A recombined polynucleotide sequence that is obtained by the process of the 



invention according to any of claims 1 to 21. 

23) A vector that contains a recombined polynucleotide sequence according to claim 22. 

24) A cellular host that is transformed by a recombined polynucleotide sequence 
according to claim 22 or by a vector according to claim 23. 

25) A protein that is coded by a recombined polynucleotide sequence according to claim 

22. 

26) A bank that consists of recombined polynucleotide sequences according to claim 22, 
or vectors according to claim 23, or cellular hosts that are transformed according to claim 24, or 
proteins according to claim 25. 
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PROCESS FOR IN'VITRO PRODUCTION OF RECOMBINED POLYNUCLEOTIDE 
SEQUENCES, SEQUENCE BANKS AND SEQUENCES THAT ARE THUS OBTAINED. 



This invention relates to a method for in-vitro production of recombined polynucleotide 
sequences that comprise the following stages: (a) the fragmentation of an initial bank of double- 
strand polynucleotide sequences, (b) the denaturation of fragments obtained from stage (a) 
optionally in the presence of one or more assembly matrices, (c) the hybridization of said 
fragments in the presence of one or more assembly matrices if the latter is not (are not) present in 
stage (b), (d) the ligation of said fragments, and (e) the cloning of recombined polynucleotide 
sequences. 
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Figure 1 B 

(Process carried out in a double strand, but illustrated here on a single strand) 
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[Key to Figure 2:] 

Banque initiale = Initial Bank 

Selection = Selection 

Banque restreinte = Limited Bank 

OU = OR 

Sous banques = Under Banks 

Matrice d'assemblage = Assembly Matrix 
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Fig. 2 Continuation 1 
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[Key to Figure 2, Continuation 1 :] 

Fragments se rehybridant sur eux-memes = Fragments that are rehybridized onto themselves 
Fragments s'hybridant sur la matrice = Fragments that are hybridized onto the matrix 
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Figure 2, Continuation 2 
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[Key to Figure 2, Continuation 2] 

RETOUR EVENTUEL EN b, c et d = POSSIBLE RETURN TO b, c and d 
Separation des sequences recombinees de la matrice d'assemblage = Separation of the 
recombined sequences of the assembly matrix 

Amplification des sequences recombinees = Amplification of recombined sequences 

e) Clonage des sequences recombinees = Cloning of recombined sequences 

CRIBLAGE des sequences recombinees = SCREENING of recombined sequences 

Famille de sequences polynucleotidiques recombinees = Family of recombined polynucleotide 

sequences 

RETOUR EVENTUEL A L'ETAPE a/ = POSSIBLE RETURN TO STAGE a/ 
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[Key to Figure 3:] 

Position des dix zones de mutations (sites Pvu II et Pst I) = Position of ten mutation zones {Pvu 11 
and Pst I sites) 
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[Key to Figure 4:] 

Position des amorces utilisees par rapport a la sequence du gene ponB = Position of primers 
used relative to the sequence of the ponB gene 
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Figure 5: Migration of the RLR reactions and the PGR amplifications of these reactions 
Tracks: 1/ RLR 1 6/ PCR RLR 1 

2/ RLR 2 7/ PCR RLR 2 

3/ RLR 3 8/ PCR RLR 3 

4/ RLR 4 9/ PCR RLR 4 

5/ RLR Control 10/ RLR Control PCR 



[Key to Figure 6:] 

Position des mutations par rapport aux fragments de restriction = Position of Mutations Relative 
to Restriction Fragments 



